A Survey on Malleability Solutions for High-Performance Distributed Computing

نویسندگان

چکیده

Maintaining a high rate of productivity, in terms completed jobs per unit time, High-Performance Computing (HPC) facilities is cornerstone the next generation exascale supercomputers. Process malleability presented as straightforward mechanism to address that issue. Nowadays, vast majority HPC are intended for distributed-memory applications based on Message Passing (MP) paradigm. For this reason, many efforts Interface (MPI), de facto standard programming model. Malleability aims rescale executions on-the-fly, other words, reconfigure number and layout processes running applications. involves resources reallocation within system, handling application, redistributing data among those resume execution. This manuscript compiles how different frameworks process malleability, their main features, integration resource management systems, they may be used user codes. paper detailed state-of-the-art devised an entry point researchers who interested malleability.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A survey on impact of cloud computing security challenges on NFV infrastructure and risks mitigation solutions

Increased broadband data rate for end users and the cost of resource provisioning to an agreed SLA in telecom service providers, are forcing operators in order to adhere to employment Virtual Network Functions (VNF) in an NFV solution. The newly 5G mobile telecom technology is also based on NFV and Software Define Network (SDN) which inherit opportunities and threats of such constructs. Thus a ...

متن کامل

Communication Constructs for High Performance Distributed Computing

This paper is concerned with the impact of languagelevel communication mechanisms on the overall performance of distributed systems. Analytic models are developed of several existing mechanisms, including RPC and several enhancements. Using this modelling, a novel construct, called responsibilities, is synthesized. Analysis of the construct, guiding refinement of the initial design, demonstrate...

متن کامل

DISCWorld: A Distributed High Performance Computing Environment

An increasing number of science and engineering applications require distributed and parallel computing resources to satisfy user response-time requirements. Distributed science and engineering applications require a high performance “middleware” which will both allow the embedding of legacy applications as well as enable new distributed programs, and which allows the best use of existing and s...

متن کامل

A survey on resource allocation in high performance distributed computing systems

An efficient resource allocation is a fundamental requirement in high performance computing (HPC) systems. Many projects are dedicated to large-scale distributed computing systems that have designed and developed resource allocation mechanisms with a variety of architectures and services. In our study, through analysis, a comprehensive survey for describing resource allocation in various HPCs i...

متن کامل

A Java Framework for Distributed High Performance Computing

The past few years have dramatically changed the view of high performance applications and computing. While traditionally such applications have been targeted towards dedicated parallel machines, we see the emerging trend of building \meta-applications" composed of several modules that exploit heterogeneous platforms and employ hybrid forms of parallelism. In particular, Java has been recognize...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12105231